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1.  Summary 

Let  xil>*"^xlm  and  Yii>«  •  •  ^Yin  >   1=1* •••>k  be  k  pairs  of 
samples  of  mutually  independent  observations  from  continuous  dis- 
tribution functions  F.(x)  and  G  (y)  respectively;  1=1, ...,k.   Then 
for  testing  the  hypothesis  F .  =  G .  ;  i=l,,«»,kj  test  statistics  of 

k  k 

the  form  (i)  T  =  T  __  c.t.  and  (ii)  Q  =  ][  _  c.Q.  are  considered. 

1=1  x  x  1=1  x  x 

Here  c  are  the  weights  which  may  depend  upon  the  sample  sizes,  t. 

student's  t  statistic  for  testing  the  equality  of  means  between  two 

normal  populations  with  the  same  variance  corresponding  to  the  i 

pair  of  samples  and  Q.  is  the  Che rnoff -Savage  Statistic   (1958)  for 

the  i   pair  of  samples.   Under  suitable  assumptions,  the  weights 

c.  which  maximize  the  local  asymptotic  powers  of  the  tests  (i)  and 

(ii)  are  obtained.   These  results  are  specialized  to  (a)  Pitman's 

shift  alternatives,  (b)  Lehmann's  distribution  free  alternatives 

and  (c)  contaminated  alternatives.  Finally,  the  asymptotic 

efficiences  of  Q  test  relative  to  some  of  its  parametric  as  well  as 

non -parametric  competitors  against  the  above  mentioned  alternatives 

are  discussed. 

2.  Introduction 


It  frequently  happens  that  several  independent  test  statistics 
are  available  for  testing  the  same  null  hypothesis.   These  may  have 
arisen  from  several  sets  of  independent  samples  which  cannot  be 
combined  perhaps  because  they  are  reported  by  different  investiga- 
tors or  because  they  have  not  all  been  gathered  under  the  same 
conditions.   In  such  situtations,  it  is  often  considered  reasonable 


: 


■    -■: 


to  combine  the  various  results  into  a  single  measure  on  which  an 
objective  judgment  of  the  evidence  as  a  whoce  can  be  based.  One 
measure  is  advanced  by  Fisher  (1932).   He  proposed  as  a  test 
statistic  the  product  of  the  tail  errors  of  the  individual  tests. 
It  turns  out  that  -2  times  the  logarithm  of  this  product  has  a 
chi- square  distribution  with  2k  degrees  of  freedom  when  the  null 
hypothesis  is  true,  k  being  the  number  of  tests.   For  detailed 
discussion  about  Fisher's  method,  the  reader  is  referred  to  the 
paper  of  Wallis  (l9i'2).   General  discussion  of  combining  indepen- 
dent tests  can  also  be  found  in  Birnbaum  (195M  and  Pearson  (1938). 
Recently,  an  interesting  technique  was  advanced  by  Ph.  van 
Elteren  (i960).   He  analyses  a  class  of  tests  based  on  linear 

k 

combinations  >   c.W.  of  test  statistics  W,  ,*..,W.  of  k  independent 
i=1  1  1  1      k 

two  sample  Wilcoxon  tests.   He  considers  in  particular  two  special 
linear  combinations,  when  (i)  c,  =  c/m.n.  and  (ii)  c.  =  c/(m.-i-n.+l) 
where  c  is  a  positive  real  number  and  iru,n.  are  the  sample  sizes  of 
of  the  i   set  and  shows  that  the  test  (i)  has  a  region  of  consis- 
tency independent  of  sample  sizes  and  the  test  (ii)  has  asympto- 
tically the  maximum  power.   In  this  paper,  we  consider  a  similar 
problem  in  a  more  general  frame  work  which  includes  as  a  special 
case  the  problem  considered  by  Ph.  van  Elteren  (i960),  mentioned 
above.   Precisely,  we  consider  the  following  problem. 

3.   Problem 

Let  X.,  Y.;  1=1,..., k  be  k  pairs  of  independent  stochastic 
variables  about  whose  cumulative  distribution  functions,  nothing 


' 


. 


••■ 


•  '-■  - 

■ 


i 


is  known,  except  that  they  are  continuous.   Further  suppose  that 
samples  of  independent  observations  are  available  and  let  their 
sizes  be  denoted  by  m.  for  X  and  n.  for  Y.;  1=1,..., k.   It  is 
desired  to  test  the  hypothesis  that  the  distribution  functions  of 
X.  and  Y. ;  1=1,..., k  are  identical. 

4.   Assumptions  and  Notations 

Let  X . ,  <    , . .    <   X.    be  the  ordered  observations  of  a  random 

sample  from  a  population  with  continuous  c.d.f.  (cumulative  dis- 
tribution function)  F.(x).   Let  Y.,  <    ...  <   Y.n.  be  the  ordered 
observations  of  a  random  sample  from  a  population  with  continuous 
c.d.f.  G1(x)  =  F^(x;9)   where  9   is  very  close  to  zero,  and  suppose 
that  for  almost  all  x,  G^x)  is  continuous  with  respect  to  9.     Let 

N.  =  nu  +  n  and  suppose  that  for  all  N. ,  the  inequalities 

m  x 

0<7vo=:Xi  =  N~=1  "Ao<:L  hold  for  some  fixed  x0  ~  V2; 


l 


1=1,., 

,.,k. 

Let 
and 

where  I[X<X]  denotes  the  indicator  of  the  event  [X^x], 

Thus  S ^(x)  and  vj^ix)  are  the  sample  c.d.f . 's  of  the  X. 's 

mi         ni  _i 

and  Y1's  respectively;  i=l,...,k.   The  combined  sample  c.d.f.  for 
the  ith  pair  is  H^Cx)  defined  as  %  (x)  =  A.S^^x)  +  (l-A^V^U) 

and  the  combined  population  c.d.f.  for  the  ith  pair  is 


H±{x)   =  -K±F±(x)    *  (l-X±)G1(x), 

Let  zA  .   =  1,  if  the  j   smallest  observation  In  the  conjoined 
sample  of  the  ibl  pair  comes  from  X±     and  otherwise  let 
Z^.  =  0.   Then  the  Chernoff -Savage  statistic  (1958)  for  the  ioh 


pair  of  samples  is 


1   r^Ji)  Al) 


(4.D  ^^^^ 

where  the  E^1.  are  given  numbers.   Note  that  Wilcoxon's  statistic 
for  the  i   pair  of  samples  is  obtained  from  (2!-.l)  by  letting 
EN   =  J/Ni  ancl  fche  normal  score  statistic  for  the  corresponding 
samples  by  letting  E^1*  =  E(V*4j)  where  V^l)  <  ...  <  V^1*  is  an 

ordered  sample  of  size  1-L  from  a  Standard  normal  distribution. 

Following  Chernoff -Savage  (l95o),  we  shall  use  the  following 
equivalent  form  of  Q. : 

(-r.2)        Q±  =  J   '°°  J.-(HN  (x^dS^U);  i=l,...,k, 

where  E**j  =  JN(j/N.). 

While  JN  need  be  defined  only  at  l/ti±> . . . jN./N.  but  may  have 
its  domain  of  definition  extended  to  (0,1]  by  letting  J^  be  constant 
on  ii/S±l    (j+l)/N.]. 

In  this  paper,  we  consider  the  statistics  of  the  form 

k 
(^o)  Q  =  I        c.Q. 

i=l  x  x 

where  the  c's  are  real  positive  numbers  and  may  depend  upon  the 
sample  sizes. 


: 
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We  may  test  the  hypothesis  Hq:  F^x)  =  G^x);  i=l,..,,k  by 
means  of  the  critical  region  Q  1  0^  where  Qa  is  given  by 

no      a 
where  a  is  the  level  of  significance.   If  the  distribution  of  Q 
under  H  is  symmetric  with  respect  to  the  origin,  then  the  corres- 
ponding left-sided  test  will  have  a  critical  region:   Q  i  -0^  and 
the  two  sided  test  will  have  a  critical  region  |q|  i  Q^1 

5.   General  Properties  of  the  distribution  of  Q 

In  what  follows,  we  make  the  following  assumptions: 

(1)  J(H)  =  lim  JM(H)  exists  for  0  <  H  <  1  and  is  not  constant. 

N4oo  N 

(2)  J       [JN(HN<)  -  J(HN_)]dSiJii)(x)  =  0p(l/N.l/2)5;  1=1,  ...,k 

IN1      1        1     i 

\=    (^0<HNi(x)  <  l]  . 

(?)   JN  (1)  =  0(x/N^) 

(4)   |j(r)(H)j  =  |^i|  <  K  [H(l-H)]-r-(l/2)+5   for  r  =  0,1,2 
dHr 

and  for  some  6  >   0  and  some  K. 

Then,  the  application  of  Chernoff -Savage  theorem  (1958)  yields 

(5.D    HmP(^j  >    <x)=  -^-e-fc/2dt, 

N^    ^±{6'        ~  J -an/2? 

where 

(5.2)   n.(e)  m    f   +0°  J[K±{x)]6F±(x) 


where 


/ 


and 


. 


•■ 


• 


. 
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(5.3)      N.cT^e)    =   2(l-?vi)[  ff  G1(x)[l-G1(y)]j[H1(x)]j[Hi(y)] 


dFi(x)dP1(y) 


ilm\)    [I  F.(x)[l-F.(y)]J[H.(x)]J[H.(y)] 

*  .  JJ   _»<V<V<I»  -. 


provided    <r±(9)    +  0. 
Thus 


•x^y 


k 


dGjL(x)dGi(y) 


(5.4)  n(0)   =  E„   (Q)    =  2ZI  c.a. 

o  i=l      x  1 

where 

,    +00 

(5-5)  a,   =   /       J[F, (x)]dP, (x) 

1        ^  -oo  x  ± 

(5-6)  K2{0)   =  varH   (Q)   ■  Y~  c2  -4-  A2 

o  i=l         Ti 

where 

(5.7) 


P         ,-  1   2  f  1 

A2  =    /        J(x)dx   -    (/      J(x)dx) 
^'0  J0 


k 

(5.8)  n(e)  =  e(q)  =  YZ.  o±[i±{e) 

(5.9)  ^T2(0)   =  var(Q)   =  XZ  c2<r2(9) 

i=l     1     x 

where  ix.(0)  and  CT.(0)  are  given  by  (5«2)  and  (5«3)  respectively. 

By  the  Central  Limit  Theorem,  the  distribution  of  Q  will  be 
approximately  normal. 

It  follows  that  the  critical  value  Q  is  approximately  equal 
to  

(5.10)  Q^  =  n(0)    +  Aa  a/Hc^.I/hJ. 

where 

2 


(5.11)  [       -i-  e"x  /2  dx  =  a 


X     v/2tt 


and  the  power  of  the  Q  test  with  respect  to  a  given  set  of  distri- 
bution functions  F.(x)  and  G.  (x)  is  approximately  equal  to 

(s.ia)         pnw-i-gtv|i-(W)>£$J 

where  <£(x)  is  the  standard  normal  distribution  function. 

6.   Locally  Best  Q  Test 

From  this  section  onward,  we  assume  that  m.  ,  n..  and  k  are  non- 
decreasing  functions  of  a  natural  number  n  that  tends  to  infinity. 
The  dependence  on  n  is  indicated  when  necessary,  by  writing  mi(n), 
n1(n),  k(n),  u.    (0),  etc.   We  shall  consider  the  following  two 
special  cases: 
Case  1:   m.(n)  and  n±(n)  tend  to  infinity  as  n  tends  to  infinity  but 

mi     and  ni    remain  bounded  away  from  zero,  k(n)  =  k  for  each  n. 
n         n 

Case  2:   m.(n)  and  n.(n)  remain  constants  and  k(n)  tends  to  in- 
finity as  n  tends  to  infinity.  For  simplicity  sake,  we  assume  that 
m. (n)  =  m. ,  n. (n)  =  n.  and  k(n)  =  n. 

Furthermore,  we  make  the  following  assumption: 
Assumption  6.1: 

For  sufficiently  large  n, 


JE[J  $K±U}n)l-   J  ^F.(x;n)]  ]/A 


remains  bounded  as  n  tends  to  infinity.   Then  we  prove  the  following 

Theorem  6.1. 

For  each  index  n,  assume  the  validity  of  the  case  1  and 
assumption  6.1.   Then  the  Q  test  with 


d.  (n)m. (n)N. (n) 

(6.1)  8t(n)  -  o     n*(n) 

v:here 

(6.2)  di(n)  -/   [j[H±(x;n)}  -  J  \  F±(x;n)  ]  ]dF±(x;n) 

and  c  Is  an  arbitrary  positive  constant,  has  for  n  — }  «>,  asympto- 
tically the  largest  power  against  all  alternatives  for  which  di(n) 
are  positive. 
Proof.   We  shall  first  prove  that 

4^°1-)  1   asn-^oo. 

For  this  it  suffices  to  show  that  <T^n' (9)    is  continuous  at  9   =  0, 
uniformly  in  n.   Consider  first,  the  first  integral  on  the  right 
hand  side  of  (5o)  and  let  it  be  denoted  by  k±(0) .      Thus 

(6.3)  A.(0)  =  ff  G.(x)[l-G.(y)]J[H.(x)]J[H.(y)] 

JJo<x<y<» 

dFi(x)dF1(y). 
Setting  F±(x)  =  u  and  Fj,(y)  =  v,  we  rev/rite  (6.5)  as 

(6.4)  A.(0)  =  If  G*(u)[l-Gt(v)]J[H*(u)]J[H*(v)]dudv 

1      ^0<u<v<l   i       i       i.      1 

where  G*(u)  =  G-tFj^U)]  and  H*(u)  =  A±u  +  (l-^jG^u). 

It  is  clear  that  integrand  is  continuous  at  9   =  0  for  almost  all  u 

and  v. 

Furthermore,  since 


' 


. 


1-G*(v)  £5-^-  [l-H*(v)J 

|J[H*(u)]|  <K[H*(u)  [l-H*(u)  \    J-5/2+5 

we  have,  from  (6.4) 

(6.5)    |a*(u)U-0*(v)]J(H*(u)]J[H*(v)]| 

[i-H;(u)]-5/2+6 

We  may  assume  6  <  1/2,  without  loss  of  generality. 
Then,  from  (6.5) 

|G*(u)[l-G*(v)]J[H*(u)]j[H*(v)]| 

<  K  -^I_  A.-M5u-l/2+5(l_v)-l/2+V5/2+5{l-u)-5/2+5 


(l-X^ 


2  "i 


Hence  by  Cramer  ([1957]*  p.  67)  A. (9)  is  continuous  at  9   =  0. 
Similarly  the  second  integral  on  the  right  side  of  (5*3)  is 
continuous  at  6  =  0.   Hence  cr2^n'(9)  and  so  a  fortiori  <r2^n'  (0) 
is  continuous  at  9   =  0. 

Next,  because  of  the  assumption  (6.1) 


10 


M^  =  ou), 

<rtn'(o) 
Hence  the  power  of  the  Q  test  can  be  approximated  by 

<"'    i-^-*(nl(^;'(o))- 

This  is  maximum;  when     k        +co 

EZci(n)[[   [J  [H.(x;n)  ]  -J  jlF^xjn)}  ] 

u^(e)-u(n)(o)   1=1       dFi(*^ 


(6:7) 


T^T 


<j-    (o)  v/r_k      : 

A^J—  [c2(n)n.(n)]/m,(n)N.(n) 

1=1   x  x  1    x 


is  maximum,  which  is  so  when  c.(n)  is  as  defined  in  (6,l) 
This  completes  the  proof  of  the  theorem. 

7.   Computation  of  d.(n) 


The  computation  of  d±(n)  highly  depends  upon  the  sequences  of 
alternatives,  we  have  in  mind.   In  subsequent  analysis,  we  shall 
concern  ourselves  with  three  sequences  of  admissible  alternative 
hypotheses  viz.  Hn,  Hn  and  H°.   The  hypothesis  Hn  specifies  that 
for  each  i=l,...,kj  G.(x)  =  F(x+-y±+-^-) ,    the  hypothesis  Hn  specifies 

that  for  each  i=l,...,k;  Q±{x)   =   [FU+7^)]  \/n  and  the  hypothesis 

Hc  specifies  that  for  each  i=l,...,k;  x.  has  the  distribution  func- 
n  ^  3. 

tion  F(x+7.)  and  y.  has  the  distribution  function 

(l-S-)F(x+7. )  +  ^-  G(x+7±);  where  y.    is  a  real  number,  I  is  a  finite 

>/n         \/n 
positive  constant  independent  of  i,  and  F^x)  =  F(x+7i).   Alter- 

P 
natives  of  the  form  H  were  introduced  "oy   Pitman,  those  of  the  form 

H  by  Lehmann  (1955)  in  order  to  study  the  non-parametric  procedures 


. 


. 


1 


-7  .    '    - 
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when  the  alternatives  themselves  are  given  In  a  non -parametric  form. 
For  an  extensive  study  of  Lehmann's  alternatives,  the  reader  is 
referred  to  an  interesting  paper  of  Savage  (1956).   Alternatives  of 
the  form  H°  are  referred  to  as  contaminated  alternatives,  which 
have  been  considered  by  Hodges  and  Lehmann  (1956)  among  others. 
We  shall,  therefore,  compute  d.(n)  and  hence  c.(n)  for  the 
above  mentioned  classes  of  alternatives.  We  shall  make  use  of  a 
lemma  due  to  Hodges  and  Lehmann  (1961)  and  the  reader  is  referred 
to  this  reference  regarding  it.   A  consequence  of  this  lemma  in  a 
form  appropriate  for  our  purpose,  may  be  stated  as  follows: 

Lemma  7.1.  ( Hodges -Lehmann ) . 

If 
(i)   F  is  continuous  cumulative  distribution  function  function 
differentiable  in  each  of  the  open  intervals  (-co, a,),  (a,,a?),..., 
^as-l,as^  ^o'00)  and  the  derivative  of  F  is  bounded  in  each  of 
these  Intervals  and  either 

(ii)  for  the  alternative  H  or  H°  the  function  grr       is  bounded 
as  x  — )  ±oo,  or 

(ii')  for  the  alternatives  h£,  the  function  F(x)log  F(x)  g^FfX^ 
is  bounded  as  x  — ft00,    then 

(7.1)  v/n  d±(n)  *  ZU-\)J  f~[F(x)  "'dF(x) ,  in  case  the  hypothesis 

p 
H  is  valid, 

(7.2)  v/n  d.(n)   -  S(l-X±)|  -F(x)    log  F(x)g[F(x)  ]dx  in  case   the 

hypothesis  H     is  valid,   and 

.  r. ^.Trw^n     .   . 

case  the 


(7.3)       y/n  d.(n)  ■*  4  (l-Ai)JT[G(x)-F(x)]g[|jx)  ]dF(x)    in_ 


hypothesis  H°  is  valid. 


dx 

into 


12 

The  proof  of  this  lemma  follows  by  the  method  used  in  section 
3  and  4  of  Hodge3-Lehmann  (1961). 

In  order  to  save  space  the  details  are  omitted. 

Now  the  quantities  I- f  g[F(x>]dF(x),  ||  -P(x)  log  P(x)  ^[F 
and  AG(x)-F(x)]~[Fix^dF(x)  being  constants,  can  be  absorbed 
the  constant  c  of  (6.1),  with  the  result  that  we  have  c.(n)  =  cm.(n) 
Thus  the  material  discussed  in  this  section  coupled  with  the  one 
discussed  in  the  previous  section  yields  the  following 

Theorem  7.1 

p 
For  each  index  n,  assume  the  validity  of  the  hypotheses  H 

or  H  or  H^  and  the  assumptions  of  lemma  7.1.  Then  for  the  case  1, 

the  Q  test  with  weights  c.(n)  =  cm.(n),  where  c  is  an  arbitrary 

positive  constant,  has  asymptotically  the  maximum  power. 

In  what  follows,  we  shall  denote  the  locally  best  Wilcoxon 

form  of  Q-test  by  the  symbol  Q.,  and  we  shall  call  it  locally  best 

Q  r  test.   Thus 

k 
%   =  IZ  cn^n)^ 

where  Q^     is  obtained  from  (4.1)  by  letting  E^1*  =  jA^. 

8.  Relation  between  Elteren's  W  test  and  locally 
best  Qy  test 

Let  X.   and  Y.   denote  the  r   and  the  s   observations  of 
X.  and  Y.  respectively;  r=l,...,mi;  s=l,...,ni. 


-   - 
■ 


1;) 


Denote 


0  •   lf  Xi,r-Yi,s  "  ° 
*  Xi,r-Yi,s  *  ° 

then  the  Elteren's  locally  best  W  test  [cf.  Elteren,  Ph.  van  (i960)] 
for  case  1  as  well  as  case  II  is  defined  as 

k(n) 

(8.1)  W  =  c  '2_  U,/[m,(n)+n,(n)+l] 

1=1  x   x 
where  m_  n_ 

(8.2)  W  =  ZZ  d  sgn(X   -Y   ) 

1   r=l  s=l      1,r     1,s 

which  is  equivalent  to  the  Wilcoxon's  statistic  [cf.  Wilcoxon  (19^5)] 
for  the  i   pair  of  samples. 
It  is  easy  to  check  that 

(8.3)  W±  =   a^NjQ^    -  m1(Ni+l) 

so  that 

k       m.N.  k_ 

(8.4)  W  =  2c  JZ  W+nTI  %     -  oTZm^ 

i=l  Ynihi  l  1     1=1  X 

Hence,  asymptotically,  the  following  linear  relation  exists  between 

the  W  statistic  and  Qy  statistic. 

k 

(8.5)  W  =  2c  Qw  -  c  YH  mi« 


In  our  subsequent  analysis,  we  shall  use  the  following  ex- 
pressions connected  with  the  Elteren's  W  test: 

k     m.n.  r  +co 

(8.6)  n(0)   =  E(W)    =  2c  T~  J-*    n    /        [G.(x)-F.(x)]dF,(x) 

1=1    mi"I"ni+J-  J  >oo  X  *  -1 

(8.7)  CT2(0)   -  varH   (W)   =  I  c2  >       jX^-     . 

o  1=1     1     i 


-    . 
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9.   On  the  Combination  of  Independent  Two-Sample  tests 
based  on  Student's  t-statistic. 

Let  F.(x)  and  G.(x)  be  normal  distribution  functions  with  the 

9  th 

same  variance  <3~  .   Then  the  student's  t-test  for  the  i   pair  of 


samples,  is  based  on  the  statistic 

(X.-Yj/x/i    +  i  cr 
l      i"    '    m.        n. 

(9.1)      t,   - 


1  r~mT  — nj 


2)T2 


J 
where 


(9.2)    xi  =  g:x1/5,i  and  ^-^Vv 

But  since  the  denominator  of  t.  tends  to  one  in  probability,  there- 
fore an  asymptotically  equivalent  statistic  is 


(9.3)       \~(\-\VJ\^\* 

which  has  normal  distribution.  Nov;  proceeding  as  in  sections  6  and 
7,  we  conclude 

Theorem  9«1« 

P   L 

For  each  index  n,  assume  the  validity  of  the  hypotheses  H  ,  Hn 


/mTTnJnTTnT 
n)  =  c  /  - 


or  IT.   Then  the  t-test  with  weights  c .  (n)  =  c  /  -= 5 — ,  has  for 

_  n   i         Nj_(n)cT^ 

n  — )  c»,  asymptotically  the  largest  power,   (c  is  an  arbitrary  posi- 
tive constant). 

We  may  note  that 
(a)   Under  H^, 





lr; 


k    m  (n)n. (n)  » 
(9.M   V.AO)   =  E(t)  =  m  c  1  ,      x  .  & 


N±(n)<r2  /FT 


k    m. (n)n. (n) 


(b)  Under  h£, 

m. ^n;n. ^nj  ,   r  , 

(9.5)  nJ^)  =  E(t)  =  -  JZ>  -i ^-o-  S.  /x(l+log  F(x))dF(x) 

fc  feT    N±(n)cr2>/n"J 

(c)  Under  H°, 

(9.6)  nt(0)  =  E( 


t)  =  ZZ  c  -i S-  L  /  xd(G(x)-F(x] 

i=l    U1{n)(rd\/nJ 


and 

p  k   p  m  (n)n, (n) 

(9.7)   <T J(0)  =  var(t)  =  ZZ  ^  ,  *  2 

c-  i=l  N±(n)G-d 

under  h£,  H^  and  h£. 

10.  The  asymptotic  relative  efficiencies  of  the  tests 

Briefly,  the  idea  of  the  asymptotic  relative  efficiency  is  the 
following: 

Suppose  that  for  testing  the  hypothesis  H  against  H  ,  two 
tests  T  and  T  require  N  and  N  observations  to  achieve  the  same 
power  3  at  the  level  of  significance  a.  Then  the  asymptotic 
efficiency  of  T  with  respect  to  T  is  defined  as 

H->„N*/N  =  eT,T.(a<P'Ho'i!U>- 

We  shall  be  interested  in  studying  the  asymptotic  efficiency 
of  (i)  the  Elteren's  W  test  relative  to  an  arbitrary  Q  test  against 
(a)  Pitman's  shift  alternatives  and  (b)  Lehmann's  distribution  free 
alternatives,  and  (ii)  the  Elteren's  W  test  relative  to  the  locally 
best  t  test  against  (a)  and  (iii)  arbitrary  Q  test  relative  to  the 
locally  best  t  test  against  contaminated  alternatives. 


(    )      ■    ■ 
■    -  :  I  ; 


•-   •      u    I  ■ 


'        .      ; 


' 


:  ■■    — 


tv    ■     ('. 


.     ■     '       ' 


-  .       ■ 

• 


- 


'        -     - 
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The  asymptotic  relative  efficiency  of  the  Elteren's  W  tesi 
relative  to  an  arbitrary  Q  test  is  stated  in  the  following 

Theorem  10.1(a) 
If 


±{n)  n±(n) 


(i)   for  all  i,  lim  — =  r,  and  lim  -~ —  =  s,  exist  and  are 

n-k»       n      1     n*=o   n      1 

positive, 

(  ii)   the  distribution  function  p  is  such  that 


[J  \   A,F.(x)  +  (l-V.)F.(x+^)  f  -  J  \  P.(x)  J;  ]dF,(: 

-co    t    x  x  x    x   X/E  >  l  > 


exists, 

(iii)  the  hypothesis  of  lemma  7.1  are  assumed, 

then 

the  asymptotic  relative  efficiency  of  the  Elteren's  W  test  relative 

p 
to  an  arbitrary  Q  test  for  testing  the  hypothesis  HQ  against  Hn  is 

'  r  +0°  o  \  2 

/   /   f2(x)dx        \ 

(10.1)       ej  Q   -  12A2    J-^ 


V 


J  «    dx 


where  f  is  the  density  of  F. 
Proof. 

Let  n  index  the  sample  size  for  the  Elteren's  W  test  and  n*  the 
corresponding  index  for  the  Q  test.   Furthermore,  let  the  level  of 
significance  be  fixed  at  a  and  the  limiting  power  at  P.   Then  the 
W  and  Q  tests  will  have  the  same  limiting  power,  if 
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k     m. (n)n. (n)  r  +w 

Up  m.(n)+n»+lj        [*(***!+*/  /n)-  F(x+7l)]dF(. 

L=±        1  1  w  -oo 


i' 


I         k       m1tn)nj,in) 
V     j'jrj    ni.ln)-:-n.ln)+l 

\r     m    (n*\ff+C°         fHl.  (n*)  n.  (n*)  ") 

i=l  x  x  - 

-j^FU+y^}  ]dF(x+7±) 

/    k     m_.  jn*Jn_,  in*} 

A  V  rz  -^tttc^t— 


i.e.   if 


/   k     r. s .  r  -H»  ,      r~k  *  r  +<» 

do.2)    2,/jyiZT~~-)  e  f    f(x)dP(x)  =  lyr-risie7 

dJ[F(x)]dF(x) 

dx 

and  the  same  alternatives,  if  4/\/n  =  |  /  ,/n* . 

Substituting  1=4  \/n*/n  in  (10.2)  yields  the  desired  result. 

It  may  be  remarked  that  (10.1)  agrees  with  the  result  found  by 
Chernoff -Savage  (1958),  Hodges -Lehmann  (1961)  for  the  two-sample 
problem  and  Puri  (1962)  for  the  c-sample  problem.  Hence  the 
efficiency  results  of  this  paper  as  well  as  those  mentioned  above 
apply  directly  to  the  present  problem. 

Proofs  of  theorems  10.1(b)  to  10.1(d)  are  similar  to  those  of 
theorem  10.1 (a)  and  are  therefore  omitted. 
Theorem  10.1(b). 
If 


.    . 


18 

m.(n)  n.(n) 

(i)  for  all  1,  11m  =  r.  and  11m  — ~ —  =  s.  exist  and  are 

nSto   n      x     n>o   n      1 

positive, 

(il)  the  distribution  function  F  Is  such  that 

+00 


llm   x/nj      [J  ^1F1(x)+(l^1)[P1(x)]1-?^rf-j[pi(x)  \  ]dF1(x)/A 

exists, 

(Hi)   the  hypotheses  of  lemma  7.1  are  satisfied, 

then, 

the  asymptotic  relative  efficiency  of  the  Elteren's  W  test  relative 

to  an  arbitrary  Q  test  for  testing  the  hypothesis  Hq  against  Hn  is_ 


<10«3)      ew,Q  =  lA2f  TTco 1      dJ  i         •>  } 

\    J       P(x)log  F(x)ff(x)^F(x}i  dF(x)  / 

In  particular  when  J  =  <|~  ,  where  §[  is  the  cumulative  normal  dis- 
tribution function, 

\  2 


(10.4)       e^Q(F(x))  -  1 


r   +oo  J 

\    /   x  log  £(x)  d£(x)  / 
\y  -co 


=  .927  by  numerical  evaluation. 
We  may  remark  that  (10.4)  agrees  with  the  result  found  by  Puri 
(1962)  for  the  c-sample  problem. 

Theorem  10.1   (c) . 

If  .  . 

m.  (n)  n.(n) 

(i)  for  all  1,  lim  -=- —  =  r,  and  lim  — - —  =  s,  exist  and  are 

n*»   n  n->~ 

positive, 


5> 
■ 


. 
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(ii)   the  distribution  function  F  and  G  are  such  that 

r  ","00   r  -  7  7 

lim  ,/n  /    [J  j  AiF(x+71)  +  (l-Ai)  I  (l-elFtx+y^+Htx+r.)^ 

-j(F(x+71)  \   ]dF(x+71)/A 

exists, 

(iii)   the  hypotheses  of  lemma  7.1  are  assumed, 

then, 

the  asymptotic  relative  efficiency  of  an  arbitrary  Q  test  relative 

to  locally  best  T  test  for  testing  the  hypothesis  H  against  H°  is 


(10.5)       e^T, 


/^[F(x)-0(x)]gAf(x)W) 

!  — : 

W    [F(x)-G(x)]dx 


If,  in  particular,  J(u)  =  u,  then  Q  test  becomes  Elteren's  W  test 

and  we  have  \  2 

f  f        [F(x)-G(x)]dF(x)  \ 

(10.6)       e°   -  12  <r2    — 


f   [F(x)-G(x)]dx 


We  may  remark  that  the  result  (10.6)  agrees  with  the  result  found 
by  Hodges,  Lehmann  (1956)  for  the  two  sample  problem.   Hence  their 
general  comments  regarding  the  merits  of  the  performance  of 
Wilcoxon  test  relative  to  t  test  against  contaminated  alternatives 
may  be  carried  along  the  present  situation. 

Theorem  10.1(d) . 

If 

m. (n)  n. (n) 

(i)   for  all  i,  lim  -~ —  =  r.  and  lim  -=- —  =  s.  exist  and  are 
n-*»  n-*o 

positive. 
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(ii)   the  hypotheses  of  lemma  7.1  are  satisfied, 

then, 

the  asymptotic  relative  efficiency  of  the  Elteren's  W  test  relative 

to  the  locally  best  T  test  for  testing;  the  hypothesis  H  against 

(10.7)       e^T(P(x))  =  |  <T2   — i 


[1+log  F(x)]dF(x) 


which  agrees  with  the  result  obtained  by  the  author  (1962)  for  the 
c -sample  problem. 

Similarly,  it  can  be  shown  that 

,2 
(10.3)       e^T(F(x))  =  12  CT2  (  J^f:2{x)6x 

which  is  known  to  be  the  asymptotic  efficiency  of  the  two  sample 

Wilcoxon  test  relative  to  the  student's  t  test.   Hodges  and 

p 
Lehmann  (1956)  have  shown  that  always  e,,r  .  (F(x))  >  0.864.   In  case 

F(x)  is  normal  distribution  function,  this  is  3/tt. 

We  now  consider  the  case  II.   Let  m.(n)  =  m.,  n.(n)  =  n.  and 
suppose  that  the  number,  say  v,  of  pair  of  samples  tend  to  in- 
finity in  such  a  way  that  the  limits : 

,   v  m  n. 

1  -  ii»  1  5=  j^i +1 

vt*>        1=1   1   1 
and 

M  =  lim  i  YZ  ~li 
v-^co  v  far  mi+ni 

exist.   Then  subject  to  the  conditions  that  underlying  distribu- 
tions satisfy  some  general  regularity  conditions,  it  can  be  shown 
that 
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(10.9)       ew,T(F(x))  =  12  °~   L  !  J    f2U)dx 


As  a  special  case,  suppose  that  N  repetitions  of  k  blocks  are 

* 
needed  for  Elteren's  locally  best  W  test  and  N  for  the  locally- 
best  T  test.  Then  we  shall  have 


k  / 

m.n.     /  r  +00 


(10.10)  eWjT(F(x))  -  12  ^g^  (/^  f2(x) 


dx 


2 


fcl  mi+ni 


In  particular,  where  m.  =  n.  =  1 


(lo.ii)     e^T(F(x))  =  8  ry2  |  ^ +°°  f2(x) 


v 

dx 


which  is  the  asymptotic  efficiency  of  the  sign  test  relative  to  the 
student's  test,  a  quantity  which  is  usually  expressed  as 

(10.12)  es,t(F(x)  =  h   0-2f2(°> 

see  in  this  connection  Hodges -Lehraann  (1956)  and  Noether  (1953). 

It  is  interesting  to  note  that  the  asymptotic  relative 
efficiency  (10.  io)  depends  on  the  number  of  blocks  as  well  as  their 
sizes.  When  the  sample  sizes  are  equal  from  block  to  block,  say 
m.  =  n.  =  m,  then  the  asymptotic  efficiency  (10.10) 
depends  only  on  the  block  size  2m.   In  the  special  case  where  F(x) 
is  normal  distribution  function  $[(x),  we  have 
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(10.13)  e^  ^(PCx))   =  IgS- 


U^11*"        7T  2m-:-l 
some  values  of  this  expression  are   tabulated  below: 

(10.14)     m  123         4  5  67  89  10.  .co, 

P     (P)    .637   .764   .818  .849   .868  .881   .891  .898   .904   .909       .955 
eW,T 

In  conclusion,  we  may  mention  that  the  results  given  here  are 
valid  for  large  number  of  replications. 
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Footnotes 

1.  This  paper  was  prepared  with  the  support  of  the  Office  of 
Naval  Research  (Nonr  -  222(^3)  and  Nonr  -  285(38)).   Re- 
production in  whole  or  in  part  is  permitted  for  the  purpose 
of  the  United  States  Government. 

2.  Chernoff  and  Savage  use  the  symbol  TN  instead  of  Q.. 

3.  If  I  x  ^  is  a  sequence  of  random  variables  and  f  r  1  a 
sequence  of  positive  numbers,  we  write  xn  =  o   (rn) ,   if 

x  /r  tends  to  zero  in  probability,  or  equivalently,  if, 
for  each  e  >   C,  ?{   Ixn'rn'  ~  e\  -^  1  as  n  ~7  °°' 
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Resume 


Sur   la  combinaison  de    testes    independent t  e s   d'une    classe 
generale  pour  duex  ochanti lions. 

Dans   cet   article,    on   analyse  des    testes   qui   sont  basees 
sur   des   combinaisons    lineaires    de     k     statistiques    independantes 
pour  deux   echantillons.      On  compare  deux  classes    de   ces    testes, 
ou  les   statistiques    employees    sont  d 'une   part  du   type  de   Chernoff 
et   Savage    [lj    et  de   1 'autre   part  du   type   de    "Student".      Sous 
certaines   conditions,    on  obtient   les   coefficients    de   ces   com- 
binaisons   lineaires   qui   donnent   les   plus    grandes   puissances 
asymptotiques    locales    de  ces    testes.      Ces   resultats    sont  parti- 
cularises   au  cas   ou  les   hypotheses    alternatives    sont   les   hypo- 
theses   (non  parametriques  )   de   Pitman  ou  de   Lehmann  ou  des 
moyennes   ponderees    souvent   appelees    "distributions   contaminees ". 
Enfin  on  discute   les    efficacites    asymptotiques   du  teste   Q 
relatives    a  quelques-uns   de   ses   conpetiteurs  parametriques 
ainsi   que   competiteurs   non-parametriques    en  relation  ^.ux   alter- 
natives mentionnees    ci-devant. 
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